A reliability study for evaluating information extraction from radiology reports.

نویسندگان

  • G Hripcsak
  • G J Kuperman
  • C Friedman
  • D F Heitjan
چکیده

GOAL To assess the reliability of a reference standard for an information extraction task. SETTING Twenty-four physician raters from two sites and two specialties judged whether clinical conditions were present based on reading chest radiograph reports. METHODS Variance components, generalizability (reliability) coefficients, and the number of expert raters needed to generate a reliable reference standard were estimated. RESULTS Per-rater reliability averaged across conditions was 0.80 (95% CI, 0.79-0.81). Reliability for the nine individual conditions varied from 0.67 to 0.97, with central line presence and pneumothorax the most reliable, and pleural effusion (excluding CHF) and pneumonia the least reliable. One to two raters were needed to achieve a reliability of 0.70, and six raters, on average, were required to achieve a reliability of 0.95. This was far more reliable than a previously published per-rater reliability of 0.19 for a more complex task. Differences between sites were attributable to changes to the condition definitions. CONCLUSION In these evaluations, physician raters were able to judge very reliably the presence of clinical conditions based on text reports. Once the reliability of a specific rater is confirmed, it would be possible for that rater to create a reference standard reliable enough to assess aggregate measures on a system. Six raters would be needed to create a reference standard sufficient to assess a system on a case-by-case basis. These results should help evaluators design future information extraction studies for natural language processors and other knowledge-based systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotation for Information Extraction from Mammography Reports

Inter and intra-observer variability in mammographic interpretation is a challenging problem, and decision support systems (DSS) may be helpful to reduce variation in practice. Since radiology reports are created as unstructured text reports, Natural language processing (NLP) techniques are needed to extract structured information from reports in order to provide the inputs to DSS. Before creat...

متن کامل

The Opinions of physicians about Radiology Reports

Background & Aims: Radiology reports are often the only means of communication between radiologists and physicians. Despite the importance of these reports, practically physicians do not give a feedback about radiology reports. This study was aimed to determine the opinions of specialists towards radiology reports. Methods: in this descriptive study, sample consisted of all specialists working ...

متن کامل

Design and evaluation of an ontology based information extraction system for radiological reports

This paper describes an information extraction system that extracts and converts the available information in free text Turkish radiology reports into a structured information model using manually created extraction rules and domain ontology. The ontology provides flexibility in the design of extraction rules, and determines the information model for the extracted semantic information. Although...

متن کامل

Applying activity-based costing to determine the final costs of radiology services in Shiraz, Iran: a concurrent equations approach

The Activity-Based Costing (ABC) method possesses the capability to identify the costs accurately and provide non-financial information for improving system function and increasing its efficiency. The present study aimed to calculate the final costs of radiology services to determine the final costs deviation from the enacted tariffs. This study was a retrospective cross-sectional analysis of r...

متن کامل

Annotation of Entities and Relations in Spanish Radiology Reports

Radiology reports express the results of a radiology study and contain information about anatomical entities, findings, measures and impressions of the medical doctor. The use of information extraction techniques can help physicians to access this information in order to understand data and to infer further knowledge. Supervised machine learning methods are very popular to address information e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 6 2  شماره 

صفحات  -

تاریخ انتشار 1999